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Abstract 



We consider a generalized processing system having several queues, where the available service rate 
combinations are fluctuating over time due to reliability and availability variations. The objective is 
to allocate the available resources, and corresponding service rates, in response to both workload and 
service capacity considerations, in order to maintain the long term stability of the system. The service 
configurations are completely arbitrary, including negative service rates which represent forwarding and 
service-induced cross traffic. We employ a trace-based trajectory asymptotic technique, which requires 
minimal assumptions about the arrival dynamics of the system. 

We prove that cone schedules, which leverage the geometry of the queueing dynamics, maximize the 
system throughput for a broad class of processing systems, even under adversarial arrival processes. We 
study the impact of fluctuating service availability, where resources are available only some of the time, 
and the schedule must dynamically respond to the changing available service rates, establishing both 
the capacity of such systems and the class of schedules which will stabilize the system at full capacity. 
The rich geometry of the system dynamics leads to important insights for stability, performance and 
scalability, and substantially generalizes previous findings. 

The processing system studied here models a broad variety of computer, communication and service 
networks, including varying channel conditions and cross-traffic in wireless networking, and call centers 
with fluctuating capacity. The findings have implications for bandwidth and processor allocation in 
communication networks and workforce scheduling in congested call centers. By establishing a broad 
class of stabilizing schedules under general conditions, we find that a scheduler can select the schedule 
from within this class that best meets their load balancing and scalability requirements. 

Keywords: random environment, stability, adversarial queueing theory, dynamic scheduling, through- 
put maximization. 

1 Introduction 

We consider a processing system comprised of Q infinite capacity queues, indexed by g £ Q = {1,2, ...,Q}, 
operating in a time-varying environment which fluctuates amongst environment states e G £ = {1,2, ...,E}. 
In each environment state, only a subset of the service configurations are available. The process scheduler 
selects a service configuration vector S = {Si, ■ ■ ■ , Sq) from the environment-dependent available set S'^. 
Upon selection, if Sq > then queue q is emptied at rate Sq, and if Sq < then the queue is filled at the 
corresponding rate. The available service configurations can be completely arbitrary, including vectors with 
any combination of positive and negative components. 

A key question addressed in this study is which of the available service configurations should be 
selected, given the system workload and environment state histories, so as to maximize its throughput. We 
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introduce a family of resource allocation policies - called Cone Schedules - which are shown to stabilize the 
system under the maximal possible traffic load, even if that load is designed by an adversary to destabilize 
the system whenever possible. 

This canonical processing model captures several applications in computing and communication sys- 
tems, including wireless networks, packet switches and call centers. The main characteristic of these applica- 
tions is that the service rates across multiple queues are coupled through operational constraints, giving rise 
to the available service configurations. Service rate availability (corresponding to the environment states) 
is affected by staff scheduling in call centers, congestion dynamics in wireless networks and scheduled or 
unscheduled outages due to maintenance or reliability issues in other processing systems. 



1.1 Related Work 



The trace-based stability analysis technique employed in this paper relates to the study of adversarial queue- 
ing networks exemplified in [Andrews et al., 200T\ and flBorodin et al., 200T[ . This approach avoids im- 



posing a probabilistic framework on the arrival traffic, and instead analyzes the peifomiance of a queueing 
network under and adversarial arrival traffic trace, designed to stress the system as much as possible. They 
describe a queueing network as universally stable when they can show that the total workload of the system 
is bounded under any deterministic or stochastic adversary's arrival trace. This work is really finding the 
worst-case behavior of a network by considering the network to be a game between the schedule (protocol) 
and the worst possible aixival trace (adversary). They limit the absolute arrival volume within a finite inter- 
val, but do not require it to follow any stationary distribution or apply any further restrictions. This concept 
builds upon earlier work called leaky-bucket analysis in |Cruz, 1991a| and |Cruz, 1991b| . 



Adversarial models have been used to in packet networks before, such as | |Borodin et al., 200T] | which 



considers a fixed-path packet network. Some more general queueing systems, including multiclass queue- 
ing networks are studied in [ Tsaparas, 2000) , with generalized service times and heterogeneous customers. 



Adversarial methods have also been employed to study multi-hop network stability in |Kushner, 2006| . In 



I Anshelevich et al., 2002| , adversarial models are used to analyze load-balancing algorithms in a distributed 
setting based using a token-based system on a network with limited deviations from the average load. While 
none of these study the same network scheduling setting of this paper (to our knowledge they have only 
considered fixed-path networks under time-invariant service environments), each example presents a per- 
suasive argument for the value of network stability analysis in the absence of a well-defined probabilistic 
framework. 

A special example of the system described in this paper is a single crossbar packet/cell switch with vir- 
tual output queues, used in high speed IP networks. The switch paradigm is the focus of |Ross and Bambos, 2009| , 



and provides a helpful context to develop the cone algorithms. In this switch, cells arriving to each input port 
get buffered in separate virtual queues, based on the output port they are destined to. The switching fabric 
can be set to a different connectivity mode in each time slot, matching each input port with a correspond- 
ing output port for cell transfer. In this context. Maximum Weight Matching (MWM) has been shown in 
I McKeown et al., 1999| | to maximize the throughput of input queued switches, employing Lyapunov meth- 



ods for stability analysis, as also in constrained queueing systems studied in | Tassiulas and Ephremides, 1992 



Tassiulas, 1995| Tassiulas and Bhattacharya, 2000[ |Hung and Michailidis, 2011 1. In our more general ser- 



vice model, MWM corresponds to maximizing (5, X) = SqXq, where the weight Xq is the cell work- 
load of queue q or a related congestion measure, and the S vectors represent the crossbar configurations. 

More general results on the stability of MWM algorithms, using fluid scaling methods, were later 
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obtained in [Dai and Prabhakar, 2000] , and on a generalized switch model in |Stolyar, 2004 1. Stability in 
networks of switches was studied in |Marsan et al., 2005] and |Leonardi et al., 2005| . | |Dai and Lin, 2005| 



and |Dai and Lin, 2008| considered maximum pressure policies by modeling fluid flows for types of pro- 
cessing networks. Their work can be seen as a generalization of the policies which maximize J2q'^g-^q 
where some of the service rates are negative because the available configurations involve forwarding work- 



load from one queue to another downstream queue. |Neely et al., 2003 1 studied broader optimal controls 



for generalized (wireless) network models that involve joint scheduling, routing and power allocation. All 
of these have significantly advanced the theory of the stability of scheduling rules which allocate service to 
queues based on a weighted-matching approach, and utilize a probabihstic framework to apply fluid limit or 
heavy-traffic analysis. 

Instead of using fluid scaling methods (primarily analytic, involving passage to a limit regime) to 
establish the results, we opt to use an alternative direct and primarily geometric approach in this work, 
which seems to have broader applicability to other queueing systems and reveals useful geometric in- 
sight regarding their dynamics. The trace-based asymptotic analysis employed here was introduced in 



[Armony andBambos, 2003 1, where the maximum weight matching algorithms were studied and it was 
shown that maintain maximal throughput is guaranteed under very general arrival process assumptions. The 



method was also employed in |Bambos and Michailidis, 2004[ where randomly fluctuating service levels 
were studied. In that case the service rate assignments are made without full knowledge of service avail- 
ability, as opposed to the processing systems studied here where service allocation decisions are made in 
response to availability. Like the adversarial queueing models, there is no probabilistic framework required, 
but unlike the traditional adversarial models, there is also no short-term restriction on arrival bursts in finite 
time, but just a long-term traffic load restriction. This leads to more general stability results, but eliminates 
the possibility of tighter bounds on other performance metrics. For example under such general assumptions 
there can be no guaranteed finite bound on the total workload, or even the expected workload in the system. 



1.2 Results Overview 



We classify the stability region for these processing systems with fluctuating service availability. We find 
that rate stability for these general processing systems can be guaranteed by the class of cone schedules, 
for any arbitrary arrival process that can possibly be stabilized. Cone schedules use the available service 
vector with maximal projection (5, BX) = SpBpgXg on the projected workload vector BX, for 

every matrix that is positive-definite, has negative or zero off-diagonal elements, and is symmetric. This 
substantially generalizes a similar- result in | |Ross and Bambos, 2009 1 , where the same class of algorithms 
was shown to maximize throughput for the special case of packet switches. 

In classifying the stability region, we show how the combination of service vectors in each environ- 
ment impacts the overall capacity of the system, beyond the long term availability of each service vector. 
The geometric framework for stability aids the intuition and analysis significantly. Because of environment 
fluctuations, one may expect that a scheduling rule needs to account for future and past states. However we 
find that cone schedules, which respond only to the current workload, are able to guarantee stability for any 
arrival rate within the stability region. 

The service rates in this paper are allowed to be completely arbitrary, in contrast to previous results 
using the trace-based analysis which only applied to positive-service switches. This captures cross-traffic 
and forwai^ding between queues, because the selected service vector may induce additional workload to the 
system, in addition to the external arrival process. Further, in this work time is continuous, and ai^bitrarily 
large arrival bursts can be handled at arbitrarily small time intervals. This is more general than previous 
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models where arrivals and decisions were restricted to timeslots. 

From an architectural point of view, the geometric approach to the scheduling problem provides key 
practical design leads. Specifically, the conic representation (Section |4| of cone schedules leads to scalable 
implementations in switching systems. Further, varying the elements of matrix B, we can generate a very 
rich family of cone schedules that implement a soft coupled priority scheme (and coupled load balancing) 
across the various queues, managing delay tradeoffs between them. The schedules ai^e also robust to any 
sublinear perturbation such as delayed or flawed state information. 

The remainder of the paper proceeds as follows. In Section |2] we introduce the model and system 
dynamics. Section |3]describes the throughput capacity or stability region of these networks, and in section|4] 
we introduce the family of Cone Schedules and their geometry. Stability and performance implications are 
discussed in sections [5] and [6] respectively. We conclude in Section |7] 



2 The Processing Structure 

Let /q Aq{z)dz be the total workload that arrives to queue q in the time interval (0, t\; that is, Aq{t) > 
is the instantaneous workload arrival rate at time t > 0. The traffic trace Aq = {Aq{t),t > 0} is a 
(deterministic) function, which may have discontinuities and even S-jumps for each q £ Q. The overall 
(vector) instantaneous traffic rate is A{t) = {Ai{t), A2{t), Aq{t), ...,AQ{t)) at time t > and the traffic 
trace is A = {A{t),t > 0}. We assume that the (long-term) traffic load of the trac^ A, 

Jo M^)dz ^ ^^^^ ^ ^2.1) 

is well-defined on the traffic trace A. Correspondingly, we define the set of traffic traces of load p G M^^, 

2t(p) = (a = {A{t),t > 0} : lim kA!^ = p\ , (2.2) 

restricting our attention in this paper to traffic traces of well-defined load. A variety of natural arrival 
processes are included. For example, Aq{t) = Yl'jLi'^g -'-{t=t^} models jobs of service requirement aq 

arriving at times > to queue q. In this case, Aq (t) is zero between consecutive J-jumps. In general, there 
could be positive instantaneous workload arrival rate between consecutive 5-jumps, which would represent 
a continuous inflow of work. 

No further restrictions are placed on the arriving traffic trace. It may be generated by an underlying 
stochastic process, or even an adversary specifically designed to destabilize the system whenever possible. 

The aniving workload is queued up in the queues q £ Q, which ai^e assumed to be of infinite capacity. 
Let Xq{t) be the workload (total workload or sei^vice requirement) in queue q at time t > and 

x{t) = iXi{t),X2{t),...,Xq{t),...,XQ{t)) eR'^^, 

the overall (vector) workload. 



"^Throughout this study we employ the notation = {1,2,3, •■•}, Zo+ — {0, 1, 2, • ■ ■ }, R = (—00,00), 
(0,00), Ro+ = [0,00) 
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The processing system operates in a fluctuating environment, which can be in one of E distinct states 
at any point in time, indexed by e G <f = {1, 2, • • • , E]. Let e{t) ^ Eht the environment state at time t and 
E = {e(t), t G M} the overall environment trace over time. It is assumed that the proportion of time the 
environment trace E spends in each state e G is well-defined, that is, 

Iimi^^Mfbf}^=vr^(E) 

t— >oo t 

with Ylie&s "^^i^) ~ 1) '''"'^(E) > 0, e G Correspondingly, we define the set of environment traces E with 
time proportions vr*^ , e G £" as 

e G ^) = |e = {e{t),t > 0} : lim io Me{z)=z}dz ^^j) 

I t~)-QO t 

and restrict our attention in this paper to environment traces that have well-defined time proportions. Finally, 
E = 1 naturally corresponds to the degenerate case of a constant (non-fluctuating) environment. 

When the environment is in state e G a (nonempty) set of service vectors S'^ becomes available to 
the system manager, who can select a service vector 5 G 5^ at any point in time to operate the system. Each 
5 G 5^ is a Q-dimensional vector 

5 = (5i,S2,...,5„...,5q) GM«, 

where S'g G M is the drain (or fill, see below) rate of queue q when the service vector S is used. For example, 
in a simple system with two queues {Q = 2), a service vector = (1.35, 2.17) would serve (drain) queue 
1 at rate 1 .35 and queue 2 at rate 2.17 (work units per time unit). This is the standard way of viewing service 
vectors. 

In this general model, however, we also allow for negative 'service' rates, actually coixesponding to 
traffic workload 'feed' rates, as explained below. In the previous simple example of two queues, a service 
vector S"^ = (1.2, -0.8) would serve (drain) the first queue at rate 1.2, but feed workload to the second 
queue at rate 0.8, filling it up. 

The motivation to allow for negative components S'g < in the service vectors 5 G iS*^ comes 
from the need to model environmental (background) cross-traffic sharing the queue buffers with the primary 
(foreground) traffic {A{t),t > 0}. This cross-traffic depends explicitly on the service vector S G 5*^ used, 
and implicitly on the environment state e £ £ through the set S'^ where the service vector S should be 
chosen from. When service vector S is used with Sq < for some queue q £ Q, this corresponds to 
cross-traffic workload fed into queue q at constant rate —Sq>0, in addition to the primary traffic workload 
{Aq{t), t > 0}. It is easy to see that —Sq > can be interpreted as the 'net' cross-traffic through the queue; 
that is, workload could be fed into queue q at rate ri > and removed (served) at rate r2 > 0, with the net 
cross-traffic load fed into the queue being —Sq = ri — r2. 

One special case related to this model is a feed-forward network. A service vector representing the 
transfer of workload from one upstream queue g" to another downstream queue q'^ would be represented 
with Squ = —Sqd and all other Sq = 0. The model here could handle the aggregate of many transfers, 
as well as gain and loss in the system at any queue. The concept of cross-traffic considered here is more 
general, requiring no restrictions on the physical structure of the network. Feed-forward networks require 
some additional assumptions and are not the primary focus of this paper, but have been studied extensively 



elsewhere, such as |Dai and Lin, 2005 |. 
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Note that the above environmental cross-traffic is far less 'innocuous' than simply allowing the pri- 
mary traffic to be modulatej^l by the environment state. Indeed, the cross-traffic depends on the choice of 
service vector 5, hence, the scheduling decisions actively influence it. The environment plays only a sec- 
ondary role by defining S'^, hence, restricting the range of scheduling choices. Actually, the introduction 
of cross-traffic is shown to have significant implications on the stability behavior of the scheduling policies 
studied later. 

The sets S'^,e S £ may be overlapping, that is, a service vector may be available under one or more 
environment states. Let S = IJeG£ assumed that each service vector set S'^, e £ £ is complete, that 

is, for each e G and any q £ Q 

82-, Sq-l,Sg > 0, Sq+l, Sq) G 5*^ {81,82, 8q-l,8q = 0, 8q) G 5^. (2.4) 

Hence, any 'sub-vector' of a service vector in S'^ (i.e. with one or more positive components reduced to zero) 
is alsc|^ a service vector in iS*^. The reason for requiring completeness of each 5^ is to accommodate the 
following situation: when some queues become empty and ceases receiving service, the resulting effective 
service vector is a feasible one. Under the latter perspective, the imposed assumption (I2.41 i is a natural one 
indeed. As seen below, it allows us to naturally handle schedules which provide zero sei-vice rate to empty 
queues. 

The key issue is choosing the service vector 8{t) G 5*^^*^ at time t, when the environment is in 
state e{t) and the vectors 5*^*^*^ are available to choose from. In general, the decision can be based on the 
observable histories of the workload {X{z),z < t}, the environment {e{z),z < t}, and prior service 
choices {8{z), z < t}. The scheduling policy is the overall trace of sei-vice vector choices S = {8{t), t > 
0}. Our primary objective is to design schedules S which maximize the system throughput (keep the system 
stable under the maximum possible load p), while being robust and utilizing minimum information, like the 
current workload and environment states, with no knowledge of the actual load p and the environment time 
proportions {vr'^, e G £}. We elaborate on such issues later. 

We are interested in natural schedules S = {8{t),t > 0} that never apply positive service to empty 
queues. That is, whenever Xq{t) = the scheduler chooses a service vector 8{t) G 5'^^*) with 8q{t) < 0. 
This is possible because we have assumed that the sets 5^ , e G £" are complete. Therefore, we can write 

X{t) = X{0) + / A{z)dz - [ 8{z)dz (2.5) 

without having to explicitly 'compensate' for any idling time. 

^Actually, the environment could also modulate the primary traffic trace {A{t),t > 0} in the following sense. There is a 
collection of traffic traces {A"^ {'t),'t > 0} one for each environment state e £ £. When the environment is in state e, the traffic driven 
into the system is selected from {A'^{t), t > 0}. Therefore, the overall traffic trace is simply {A{t) = JZeef ^'^(^)l{e(t)=e}, i > 
0}. Hence, this basically reverts to the standard model (as long as the limit limt^.oo Jq A{z)dz/t exists) and this is why we do not 
treat this case explicitly. 

*Note that if any service vector in S"^ has no negative components, then the zero vector (0, 0, 0) must be in .S*^ as a sub-vector 
of the former vector, due to completeness. But if each service vector in 5*^ has at least one negative component, the zero vector 
does not necessarily have to be in .S*^ unless it is by design. 
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3 The Stability Issue 



In the interest of robustness of the results, we employ the 'lightest' possible (see below) concept of stability, 
that is, rate stability |Bambos and Walrand, 19931 - Specifically, we call the system stable iff 

,i„> = f , ^ifi) . . ^aWU 0. (3.1) 

t-^oo t t-i-OO \ t t t t J 

Note that from (12.51 ) and (12. It . rate-stability implies that p = lim(_!,oo{/o S{z)dz/t}. Moreover, when the 
traffic trace involves pure 'job-arrivals' (5-jumps) with zero workload arrival rate between them, then rate- 
stability (13.1b imphes that the long-term job departure rate from each queue is equal to the long-term job 



arrival rate [ Armony and Bambos, 2003] . Therefore, there is flow conservation through the system and the 



inflow at each queue is equal to the outflow. On the contrary, when the system is unstable there is a inflow- 
to-outflow deficit, which accumulates in the queues. This is consistent with engineering intuition and, in 
that sense, the concept of rate-stability is quite natural. Of course, it can be further tightened by imposing 
progressively heavier statistical assumptions on the traffic and environment traces. We resist doing that 
at this point, in order to preserve the generality of the results and keep them as robust and 'assumptions- 
agnostic' as possible. 



Definition 3.1 (Stability Region) We define formally the stability region TZ of the system as the set of 
traffic loads p € M^_^ for which there exists a scheduling policy S = {S{t),t > 0} under which the system 
is rate-stable (13. Il l for all traffic traces A = {A(t),t > 0} with p(A) = p and all environment traces 
E = {e{t),t > 0} with 7r'=(E) = vr^ e G £. 

As shown below, the universal stability region TZ can be characterized as 

7^(5^7r^e G £) = 

JpeM'^:0<p<^7r^^ 0^5, for some > with ^ 01 = 1, e G f I (3.2) 



The intuition is that p is in the stability region TZ if it is dominated (covered) by a convex combination 
of the service vectors S* € 5, induced under the various service vectors in 5*^,6 € £. Thus, TZ is the 
'weighted sum' of the various 'stability regions' generated by the individual sets S'^ for each state e G £" of 
the environment. 

If p GTZ and vr^ , e G £" were known in advance and 4>g could be computed, then selecting each mode 
S £ S'^ for a fraction (/)| of the time while the system is in environment state e G 5 would keep the system 
stable. This could be achieved through round-robin or randomized algorithms. A scheduling algorithm 
which maintains stability (13.11 ) for any p G 7^ is referred to as throughput maximizing. However, we are 
primarily interested in adaptive scheduling schemes which maintain stability (13.21 ) for all p G TZ, without 
actual prior knowledge of p or vr*^. The cone schedules defined below are shown to provide such universal 
stability for any traffic load in TZ, while being agnostic to particulars of the traffic and environment traces 
p(A) and vr^(E) = vr^, e G <5; they respond only to current workload and environment state. 

In general, the stability behavior of scheduling rules could require the arrival trace to satisfy stronger 
conditions than those above. For example, restricting the study to Markovian or stationary anival processes. 
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or disallowing mixing, may provide special cases of stability. Instead, we allow the arrival traffic trace A 
and environment trace E to be designed by an adversary to stress the system. Consider for example an 
arrival trace where arrivals to queue q are deliberately correlated to the environment states when q cannot 
be served at maximum capacity. Even further, an adversarial trace may push anivals to queues in a state- 
dependent way which responds to the scheduling rules themselves. These are very difficult to capture by a 
natural probabilistic framework, but are simply treated as particular traffic traces here. 

To motivate the definition of the stability region for the processing system under consideration, we ex- 
amine first the case where Sq > for all g S Q and there is only one environment state (E = 1, no environ- 
ment fluctuation); that is, service is always non-negative and all service vectors S are available at every point 



in time. Under the trace-based perspective employed in this paper, it is known | Armony and Bambos, 2003 | 
that for any load p in the region 

I p G R^^ : p<Y^ (psS for some (ps>0, S £S, such that ^(p'^ <l> 
I Ses SeS J 

the system can be made rate stable with an appropriate scheduUng rule. The non-negative parameters 
4>s, S G 5 are essentially proportional weights, which are chosen so that the load vector p is component- 
wise dominated by the weighted linear combination J2seS 'i^^^ sei-vice vectors. 

Extending this 'geometric' stability perspective to allow cross traffic and varying environment states 
is not a trivial task. Intuition may suggest that the stability region in networks in fluctuating environments 
should be reduced according to how often each mode is available. Consider the following simple network 
to illustrate that the distribution of environment states {vr*^, e G £"} is critical to stability. Take a 2-queue 
network with three service vectors, = (1, 0), S*^ = (0, 1), = (1,1). Cleai^ly, if all vectors are available 
all the time, by employing always the system can accommodate any input vector {pi, P2) G [0, 1]^. On 
the other hand, if there are two environment states £ = {ei, 62} with service vector sets = {S^, S^} 
and = {53} ^j^j^ ^ei ^ q 5^ ^ea ^ 0.5, then the system can accommodate any input vector p > 
satisfying the conditions pi + P2 < 1-5, pi < 1, and /92 < 1- 

However, a different configuration of the service vector sets, say S'^^ = {S^} and = {5^, S^} 
with vr'^i = 0.5, vr^^ = o.5, yields pi G [0, 1] and p2 G [0, 0.5] for stabihty. Note that although the sets S''^ 
and 5*^2 ensure that each service vector is available for the same portion of time in both scenarios, the relative 
combinations of the available service vectors change the stability region. We illustrate (and generalize) this 
perspective in Figure [T] 

We establish first that if p ^ TZ, it is impossible to maintain stability and flow conservation in all 
queues, no matter what scheduling policy one employs. At least one queue will suffer an outflow deficit 
(compared to its inflow), which will accumulate in the queue and cause its workload to explode linearly it 
in time. 



Proposition 3.1 (Instability) For any arbitrarily fixed traffic traffic trace A and environment trace E, we 
have 

X (t) 

p(A) ^ limsup— ^ > 0, (3.3) 

t— i>oo t 

for at least one queue q £ Q under any scheduling policy. 



Proof: For convenience, we drop the fixed argument A from p(A) and write it traffic load as simply p, 
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Figure 1 : The stability region. The set of allowable anival rate vectors p is called the stability region TZ. 
Two sepai^ate sets of sei^vice vectors are shown in the first two plots, with their respective stability regions 
if they were the only environment state, and available 100% of the time. The third plot shows the stability 
region when vri = 0.8 and tt2 = 0.2. This corresponds to the environment state fluctuating so that 80% 
of the time, the service vectors from the first group are available, and 20% of the time the service vectors 
from the second group ai^e available to be scheduled. For any p in the region TZ above, there is a convex 
combination of service modes within the resource sets which would apply a total service rate to each queue 
which is at least the arrival rate to that queue. For p outside TZ there is no such combination. Service modes 
S"^ and 5^ are strictly dominated by a convex combination of other service vectors and therefore do not 
contribute to the stability region (and in fact need not be utilized to maintain stability). Service vectors with 
negative components such as and above may contribute to the stability region without being inside 
the stability region itself. The stability region for the combination of environments can be seen to be the 
weighted sum of the two original stability regions, with care taken to the impact of extreme points with 
negative components. 
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and proceed by contradiction. If (13.31 ) does not hold, then from (12.51 ) we must have lim^^i^oo — ~ 
limt_>oo "^l^)*^^ _|_ linif_j.oo = P- But then we have 

t— >oo t 

^ /o Eegg Eggg^ Ie(z)=e,5(z)=5'S'^^ 

where = limt_>oo satisfies > and Z^5g5e = 1, which satisfies (13.21 ). We then 

easily get (arguing by contradiction) that limsupj_^(^ X''(t)/t > for at least one queue g G Q. ■ 



4 Cone Schedules and their Geometry 

We focus in this paper on schedules that ai^e workload-awai^e and resource-awai^e but not rate-awai^e; that 
is, the system's operator can observe and respond to both the environment state e{t) and the workload state 
X{t), but has no knowledge of the long-term load vector p and state probabilities tt^. 

In particular; we examine a family of resource allocation policies that ai^e called Cone Schedules and 
are parameterized by a fixed matrix B. These schedules select the service vector S € 5^^*^ that has the 
maximal projection on BX(t), when the workload state is X{t) and the environment state is e{t) G £. 
Specifically: 



Definition 4.1 (Cone Schedules) Given a fixed Q x Q real matrix B, a cone schedule is one that, when the 
environment is in state e ^ £ and the the workload is X € it selects a service vector S'^{X) in the set 

S%X) = arg max (S, BX) = {S e : (s, Bx) = max (S, BX)} (4.1) 

which satisfies Sq = whenever Sq = 0. We show that such a vector must be contained in S'^ by proposition 
14. II below. The set S'^{X) C 5^ is nonempty, but may contain several service vectors in .S*^, in which case 
one is arbitrarily chosen by the cone schedule. Note that 



S^(X),BX) = max (S,BX) , (4.2) 

so the chosen S'^{X) is one of maximal projection on BX amongst those in .S*^. Therefore, the service 
vector S{t) chosen by the cone schedule at time t > is 

S{t) € S^^^^X{t)) = arg max {S,BX{t)) , 
based on the observed current workload X{t) and environment state e{t). 
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Notice that the maximization {S,'BX) = J2q ^q(.^-^)q ensures that cone schedules follow some 
important intuition for a scheduling rule. We see that (BX)q is increasing in Xg and decreasing in Xp for 
p ^ q- This will increase whenever Xg comes to dominate other queues. By maximizing this sum, the 
cone schedules all prefer large positive sei^vice rates Sg whenever {'BX)q is large and positive. Thus the 
schedules will prefer remove the most workload from the longer queues, and restrict the cross-traffic added 
to those longer queues. The relationship to performance and load balancing is discussed in section |6] 

Proposition 4.1 (Matrices B with Negative or Zero Off-Diagonal Elements) If the cone schedule ma- 
trix B = {Bij,i,j € Q} has negative or zero off-diagonal elements (Bjj < 0,i 7^ j) and the service 
vector sets are complete for each environments state e € £", then there must exist some S^{X) G S^{X) 
for which we have 

for each q ^ Q. Thus, for such B matrices, the corresponding cone schedules can always select sei^vice 
vectors that provide no positive service rate to an empty queue. 

Proof: Given a workload vector X such that = for some (empty) queue g G Q, let us examine the 
inner product maximized by the cone schedule (14.11 ) in selecting S'^{X) G S^{X), that is, 

(5,BX) = = 5^{5i5,,Xi + 5i( Bi,Xj)}. (4.3) 

with S ^ S'^. Consider the term corresponding to the empty queue q in the above sum, that is, 

SgBggXg + Sg{ ^ Bg,X,). (4.4) 

jeQ-{g} 

Since Xg = 0, the first term above is automatically zero, iixespectively of Sg and Bgg. However, since 
Bqj < and Xj > for each j G Q — {g}, we see that 

Bg,X,<0. (4.5) 

ieQ-{g} 

Arguing by contradiction, assume that {Si, S2, Sg^i, Sg > 0, Sg^i, Sq) G 5^ maximizes 
(14.31) with Sg > 0. But because S'^ is assumed to be complete (12.41) . the vector (S'i,S'2, Sg-i, Sq = 
0,Sg^i, ...,5'q) also belongs to S'^ and has Sg = 0, hence, leads to an equal or greater value of (14.31 ) 
because of (14.4b and (14.51 ). This establishes a contradiction and implies that the set of service vectors 5 that 
maximize (14.31 ) must always include one where Sg < (provide no positive service rate) for each empty 
queue q ^ Q (that is, with workload Xg = 0). 



To justify the term 'cone' schedule consider the following perspective. Define first the set of work- 
loads X for which the cone schedule would choose the service vector 5 when the environment is in state e, 
that is: 

C| = |x G <^ : {S, BX) = rnax (5', BX) ] 



11 



for 5" € 5^, e € This is simply the set of workloads X that have maximum projection on 5 € 5^ 
amongst all other sets in S'^. Note that C| is a geometric cone because (5, BX) > (5', BX) implies that 
(5, BaX) > (5', BaX) for any positive scalar a G R+ and S, S' G 5^ Thus, if X belongs to C| then any 
up/down-scaling aX also belongs to it. 

For each environment state e ^ £, the cones Cg,S ^ form a partition of the workload space, that 

is, 

U C| = M^+. 

In general, some cones may actually be degenerate (like those corresponding to service vectors in 
that are fully dominated component-wise by others in S^) and several cones may share common boundaries. 
Observe that the cone schedule can now be geometrically defined as follows: 

When the environment state is e and the workload X G C| => choose S^{X) = S £ S^. 
The cone structure of the sets C% motivates the name cone schedules. 




Figure 2: The cone schedules assign a service vector from by identifying the location of X with respect 
to the cones formed by C|. This figure shows the cone structure for a system with Q = 3 queues and 4 
service vectors for this particular environment. When X is in cone Cg, then service vector 5 con^esponding 
to that cone is used. The vector X will fluctuate within R^, switching between service vectors when the 
arrivals and departures cause X{t) to cross a cone boundary, or when the environment state changes. The 
cone boundaries are influenced by the environment state and the matrix B. 

When the environment is in state e G £" and the workload X is in the interior of the non-degenerate 
cone Cg, then the only sei^vice vector that can be used by the cone schedule is 5 G 5*^. However, if X is on 
the boundary of several adjacent cones (for example, X G C|i f] Cg2 f] ^^53)' "^hen any of the service vectors 
corresponding to these cones can be used (S^, or 5^, or S^). Therefore, given a workload vector X, we 
want to define the cone it belongs to, which consequently specifies what service vector the cone schedule 
ought to use. We proceed in this direction below. 

To take another perspective, recall that when the environment is in state e G £" and the workload is 
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X, then the cone schedule chooses a service vector in the set 

S^{X) = 1^ G 5" : (^S,BX^ = max(5,BX)| C 5"; 

any vector is arbitrarily chosen, if there are more than one vector in the set S'^{X). When X is in the 
interior of the (non-degenerate) cone C|, then iS^(X) = {S} is a singleton and S'^{X) = S. This follows 
since the interior of a cone denotes all workload vectors X for which the inner product {S, BX) is uniquely 
maximized by S. 

To cover the general case of X being on a cone boundary (perhaps, a common boundary of several 
cones), we define the 'surrounding' cone of the workload vector X as 

C%X)= [j c% 

For example, if X is on the boundary of C^i and C^2 only, then C'^{X) = C^i [JC^2- Note that the above 
definitions lead to the following equivalence 

C%X) C C%Y) ^ S%X) C S%Y), 

as well as 

C^{X) C C^{Y) ^ X G C^{Y) 
for any two workload vectors X,Y ^ M^, and environment state e G £. This is illustrated in Fig. |3] 




Figure 3: For workload vectors X which lie precisely on the boundary of two or more cones, the cone 
is the union of all of the cones in which include X. In contrast to Fig. |2l where X was interior 
to a single cone, the above illustration shows X at the boundary of 3 of the Cs cones. In this case C{X) = 
Cgi U C52 U CgA includes all the elements of the three different cones. This definition is important in the 
proof because it captures the workload vectors which share an optimal service vector with X. 

Note that if y G C^{X) then there must exist a service vector S* G 5"^ for which both (S*, BX) is 
maximized at S and (S*, 'BY) is maximized at S, and if y ^ C^{X) then no such vector can exist. 

We observe that X cannot be on an interior boundary of C'^{X) (the only boundary it could be on is 
where the cone meets an axis because of the non-negativity constraint). If X were on an interior boundary 
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X 



1 



Figure 4: The cone C{X) over environments £ is illustrated. Here, X is in the cones C^^{X) and C^'^{X) 
for the two environments ei and 62. The cone C{X) is the intersection of both of those cones. Since X is 
known to be on the interior of each cone, X is also on the interior of C{X). 



then there must exist a direction vector 5 7^ for which {X + \5) > and {X + X6) ^ C'^{X) for an 
arbitrarily small positive scalar A. This means that there exists some service vector E for which 
{S^,B{X + A5)> > {S,B{X + X6)) for all S G 5^ But since ^ we also have {S\BX) < 

{S, BX) for all S G 5^ This leads to the inequality 



Since the left hand side can be made arbitrarily small this leads directly to a contradiction and we 
conclude that X is indeed on the strict interior of C^i^X). This observation becomes critical in the proof of 
stability. 

Finally, we define the cone around X with respect to all environment states e G £" as 



The cone C{X) is illustrated in Fig|4]is of course non-empty because X belongs to each cones C^(X), e £ £. 
This is the cone of workloads Y for which, at each environment state e £ £, the cone schedule could have 
selected for Y the same service vector as for X (fixed), that is. 



hence, when Y eC{X), then for each e G £" we have S%Y) G S%X), besides S%X) G S%X) of 
course. Viewed another way. 




> 0. 



c{x) = n c%x). 



C{X) = {Y e : 5"(y)p|5^(X) / 0, for each e G £}; 
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C{X) = {Y e M^^ : (^S%Y),BX^ = max {S, BX) for each e € £}, 

that is, when Y € C{X), then for each e € £^ we have that S^{Y) has maximal projection on BX, 
besides also having maximal projection on BY (by definition). 

We note that since X is strictly on the interior of each cone C'^(X) and there are finitely many envi- 
ronment states in E then X is strictly on the interior of C{X). 

The cone C{X) turns out to be of key importance in the stability proof below. This completes the 
geometric picture of cone schedules. 



5 Universal Stability of Cone Schedules 

We are primarily interested in the throughput maximizing properties of cone schedules for various families 
of matrices B, given the traffic load p. The following theorem establishes that stability can be maintained 
for any p £ TZhy rich families of matrices B. 

Consider a cone schedule generated by the matrix B and operating on any arbitrarily fixed system S 
chosen from the class S of processing systems defined by: 

1. some set of queues Q and some set of environment states 6, 

2. some environment trace E G i2;(7r^, e S as per (12.31 ). 

3. some (non-empty) service vector sets 5^, e G £" that are complete, as per (I2.4I ). 

4. some traffic trace A = {A{t), t > 0} G 2l(/9) with load p(A) = p, as per (l2!2l ). 



Theorem 5.1 (Universal Stability of Cone Schedules) Given the above assumptions if B is positive-definite, 
symmetric and has negative or zero off-diagonal elements {Bqp < 0.,p q £ Q), then 

X(t) 

p(A) G 7^(5^ vr^ e G £■) ^ lim = (5.1) 

t^-oo t 

universally on © . That is, each system in © is (rate) stable under such a cone schedule, when p{A) G 
7^(5^7^^e G £). 



It turns out that B being positive definite and having nonpositive off-diagonal elements are both 



necessary for universal stability, which was shown in | |Ross and Bambos, 2009 1. 



To see why nonpositive off diagonal elements are required, consider a simple network with Q = 2 
queues and E = 1 environment state, where B = [2, 1; 1, 2] is used. If 5^ = (1,0) and 5^ = (0, 3) are the 
two available service vectors then BX) = 2Xi + X2, and {S^, BX) = 3Xi + 6X2. Since {S'^,BX) 
strictly dominates (5^, BX) for any nonzero workload, would never be selected and any arrival process 
with pi > will be unstable. 
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To see why positive definiteness is required, consider a simple network with Q = 2 queues and 
E =1 environment state, where B = [1, -2; -2, 1] is used. Let = (1, 1), S"^ = (0, 0), = (1, 0) and 
5^ = (0, 1) be the available sei-vice vectors. Then we have {S^,BX) = -Xi- X2 <0 = (5^, BX), and 
will never be selected. The effective service rates applied to the queues Sq = limt_j>oo j Sq{z) must 
then satisfy Si + S2 < 1. Now p = {pi, P2) with 0.5 < < 1 is contained within TZ by (13. 2I ). but rate 
stability cannot possibly be achieved in ( 12.51 ). The parameters of the non-positive-definite matrix B cause 
the cone schedule to avoid utilizing S^, which is critical for rate stability because it lies on the convex hull 
of 7^. 



5.1 Proof of Theorem O 

We prove rate stability via a sequence of intermediate steps. 

Consider any arbitrarily fixed environment trace E = {e{t),t > 0}, such that 5*^ is complete and 

lim ^i-(-)-}^" = 
t 

for each e G <S. Consider also any arbitrarily fixed traffic trace A = {A{t),t > 0} satisfying 

lim !oA^ = p(A) G 7^(5^ ^^ e G £). 

t—>-oo t 

We note that while A and E ai^e fixed, they can be generated ai^bitrarily, including by an underlying 
stochastic process or an adversary. Recall that by Proposition ( 14.11 ) when B has negative or zero off-diagonal 
elements the generated cone schedule applies no positive rate to empty queues. Therefore, 

X{t) = X{0) + / A{z)dz - [ S{z)dz (5.2) 

JQ Jo 

for the workload X{t) at time t - as in (12.51 ) - without having to compensate for any idle time. 

Proposition 5.1 Under the conditions conditions of Theorem 15.11 the service vectors S'^{X) G S'^{X) 
selected by the cone schedule under various environment states e G £" satisfy 

(p,BX) < J]^^(5^(X),BX) = J]7r'^max(5,BX). (5.3) 

for each workload X G 

Proof: First, choose any workload X and fix it. Since p G 7^, we have p < J2ee£ JlseS" '^S'^ according 
to (1221), or 

0<Pq<^n^ (psSq, for each q £ Q, (5.4) 

for some positive weights 0| > such that X]5e5<= 'Ps — ^■ 

We denote Vq = {'BX)q and note that this may be negative for some g G Q. We examine, the 
following two cases: 
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1. lfvq = iBX)q > 0, we get from (El that 



PqVq <J2^'' <l^%SqVq. (5.5) 

2. If = (BX)g < 0, we have 

PqVq < 0, 

since Pq > 0. 
Combining the two cases, we get 

PqVq < X] "/"sSgMv.S.m^l ^Y^^Yl ^sSq'i-{v,>0}hs,>0}Vq 

ee£ SeS<= e&S S&S" 

for q & Q. Adding the terms up over g € Q, we get 

{p,v)<Y,^' rs{V{S),v) (5.6) 

where V{S) = (5gl{„^>o,5^>o}), (7 G Q) is the vector generated by the service vector G 5^ by setting 
the components Sq for which Vq < and S'g > 0. 

Now recall that for each e € £" and S € 5^, V^(S') is a sub-vector of 5 (dropping some positive 
components to 0) and is also in 5^ because the latter set is complete. But the service vector S^{X) selected 
by the cone schedule (14.21 ) has the maximal projection onv = BX amongst all those in S'^, so {V{S),v) < 

S^{X),v) for every 5 G S^. Therefore, (I5.6l i becomes 



{p,v) <Y^n^ ^s{ViS),v) <Y^n^ 
see S€S^ ee£ 



where the last inequality holds because J2seS''' 't's — ^ e.a.ch e £ £. Putting back v = BX, we get 



{P,BX)<Y^"{S%X),BX), 



which completes the proof. 



X(t) f* S(z)dz 

Lemma 5.1 We have that limt-j.oo —^-^ = implies limt_s-oo — — = P- That is, the long-term applied 
service rate is equal to the long-term traffic load, when the system is (rate) stable. 

Proof: This is immediately obtained by dividing (15.2b by t and letting t^oo. ■ 



Lemma 5.2 Consider two arbitrarily fixed, increasing, unbounded time sequences {tn}^=i and {snjJ^Li 
with Sn < tn for each n > 1. If lim„_j.oo = (or equivalently lim„_j.oo 1^ = 1), then 

^. X{tn)-X{Sn) X{tn)-X{Sn) ^ 

lim = lim = 0. 
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Proof: Note that < f^" Ij^^^-j^^jd-z <tn — Sn for each S € S = Uee£'5^. Dividing by tn and taking the 

r'" 1 - _ dz 

Umit as n^oo, we get liiiin^oo " {■s(")=^> = o. Recalling that 



- X{Sn) = I " A{z)dz -Y.^ r 1{5(.)=5}^^' 

dividing by i„ and letting rn^cxo, we get 



s&s 



n— >cjo n— !>oo tn n— >-oo t„ 

= lim it^^ - lim i^t^^ ^ - 

n— >oo n->-oo s„ tn 

= (5.7) 
Moreover, lim„_^oo = lim„^oo = 0. This completes the proof. ■ 

Lemma 5.3 Consider an arbitrarily fixed, increasing, unbounded time sequence {tn}'^^i- The following 
result then holds: 

n— i-oo tn 

Proof: Clearly the result holds at times t when A{t) is finite. The issue arises at times t„ when A{tn) 
has a J-jump and the workload suddenly shifts by a finite amount, which may actually be increasing in 
consecutive jumps. Let tn be the time of a job arrival to queue q G Q, where jn the index of that job and 

(7g the workload added by the job. It is then sufficient to show that lim„_j.oo -f- = 0. Indeed, note that 

a{" = / Aq{t)dt - / Aq{t)dt (5.8) 
Jo Jo 

Dividing by t„ and letting n— )-oo, we have lim„^oo = Pq — Pq = 0, which proves the lemma. ■ 



5.1.1 Building a Contradiction. 



X(t) 

The objective of the proof is to show that limt_!.oo = 0, when p G TZ. Since B is a positive-definite 
matrix, it is sufficient to show that \mit-^oo B^^^ = 0. 

The proof proceeds by contradiction. Assume that lim sup4_j.o(3 \^^) ^^^1^ / ^ {'ta}T=i 



be an increasing unbounded time sequence on which the supremum limit is obtained; let 

lim = ^ ^ (5.9) 

a->oo ta 

be the corresponding limit. Such a convergent subsequence must exist by the compactness (since bounded) 
of the set of possible valued for at large times. We will construct a related unbounded time sequence 



'For any arrival trace, we have X{t) < A{s)ds, which implies that ^ < i^^^ 
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{sd}T=i and show that it has the property linirf^oo B^^^ > lima^oo {|^|^, B^f^^ > 0. The 

existence of such a sequence will contradict that the supremum limit is attained on {^alj^i- 

We establish the required contradiction by finding an increasing unbounded subsequence {tc}^i of 
{ta}'^i, and a related sequence {sd^i, which satisfy the following two Key Properties: 



I. limc_>.oo = e € (0, 1) and Sc < tc for each c. This implies that liiiic^oo = 1 ~ 

II. C{X{t)) C C{r]) for all t G {sc, tc] and each c. This implies that the workload X(t) drifts within the 
cone C(r/) surrounding t] = limc_).oo ^ " throughout the time interval (sc,tc]- 

The associated intuition is that Sc marks the last time before tc that the workload vector X{sc) (re)enters the 
cone C{7]) and reaches X{tc) ~ rjtc at time tc, drifting in C{r]) throughout the time interval {sc, tc]- 

Before constructing the above sequences with properties I and II we show their implications for 
establishing the required contradiction. 



Lemma 5.4 If the sequences {tc}^i and {sdJ^i satisfy the Properties I and II above, then the supremum 
limit is not attained - as initially assumed - on the sequence {td^i (which is a subsequence of {ta}^^). 
This establishes the targeted contradiction. 



Proof: Since B matrix has negative or zero off diagonal elements, the cone schedule does not apply any 
positive service rate to any empty queue (14.1l i. Therefore, by (15.21 ) we have 

X{tc) - X{sc) = j ' A{z)dz - j ' S{z)dz (5.10) 
and projecting on Br? (15.91 ) we get 



{X{tc) - X{sc),B7j) = ( r A{z)dz, Bi]) - ( r S{z)dz, Br?^ 

\Jsc / \Jsc I 

' t A{z)dz, B7?\ - r {s'iXiz)), Bt?) l{eiz)=e}dz (5.11) 



where S^{X{z)) G S'^{X{z)) for z > 0. But because of Property II above, the workload X{z) drifts in the 
cone C{'q) throughout z € {sc,tc\, which implies that 

(5^(X(z)),B7?) =max(S,B77) 
when e{z) = e £ £. Substituting into (15.111 ) we get 

{X{tc)- X{sc), Bt]) = ^l\iz)dz,Br]'^ - he(z)=e}dz^ max (5, Br?) (5.12) 



Obsei-ve now that 

A{t)dt itAm,. tc !tMt)dt , 
lim = lim lim lim lim 

c->-oo tc — Sc c-^co tc c-s-oo tc — Sc c-s>oo Sc c^-oo tc — Sc 

1 A 

=,--p(--l) 

= p 
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because of Property I above. Dividing [5. 12l by {tc — Sc) and letting c^oo, we get 

lim / ^MzL^iM, B77\ = {p, Br?) - V vr^ max {S, Br?) = -7(r/) < (5.13) 

for 7(r7) > 0. The inequality —7(7?) = {p, Br?) — X^gg^ vr*^ maxsg^e (5, Br/) < is due to (15.31 ). since it is 
assumed that p gTZ. 

Since {tc}J?^i is a subsequence of {ia}^^ we have limc_j.oo ^f"^ = V- Using Property I and (15.131 ) 
we get the following inequality 

\ Sc / c^oo I \ Sg / \ Sc 



lim 

c— >oo 



X{tc)-Xisc) ^^^ 



1 - e " 1 - e 

> (?/,Br?) (5.14) 

The last inequality is due to the facts that e G (0, 1) and 7(ry) > 0. 

By successive thinnings of the components of the workload vector, we can obtain an increasing un- 
bounded subsequence {sdj'^i of {sc}^i such that limd~^oo = and from (I5.14l i 

(V',B?7) > (r?,B?7) (5.15) 

Since B is positive-definite we have {ip — r], B('i/' — r/)) > 0. This implies (-0, B-i/') + {rj, Bry) > {ip, Br?) + 
(r/, Br/i). Since B is symmetric (self-adjoint) we have {r], Bip) = B?/). Therefore, 

(V, Br/.) + (r/, Br?) > 2 (r/., Br?) > 2 (r?, Br?) , 

using (15.151 ) for the last inequality. Thus, (r/;, Br/;) > (r?, Br?) or 

lim /^,B^\ = (r^,Br^) > Br?) = limsup,_ B^\ > 0, 

d^oo \ Sd Sd / \ t t / 

giving a contradiction to the definition of t]. This completes the proof of Lemma [S!4l ■ 



5.1.2 Constructing Sequences with Properties I and II 



It now remains to construct sequences {tc}^i and {sd^i satisfying properties I and II. Their construction 
is based on the intuition mentioned above, which is made formal in the following lemma. 



Lemma 5.5 Suppose limfc_j.oo = r? / for some increasing unbounded sequence {tk}'^i and 

nonzero rj. Let 

Sk = snp{t<tk:C{X{t))<^C{v)} (5.16) 

be the last time before tk that the cone C{X{t)) is not included in C{r]). This is the last time that X{t) 
crosses from outside C{7]) to inside, hence, X{t) G C{ri) for every t G {sk,tk] and the workload drifts in 
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C{r]) throughout that interval. By convention Sfc = if the workload has always been in C(r/) before tk- We 
then have 

lim inf ^II—^ = ei > (5. 17) 

k—^oo tk 

for some ei G (0, 1). 

Proof: Arguing by contradiction, suppose that there exists an increasing unbounded subsequence {tn}'^=i 
of {tfcl^i such that lim„^oo ^'i^ = 0. From LemmaOwe have that hm„^oo ^'^^"'^'^'^^"^ = 0. Since 

limji^oo ^^j^ = 7], we then get lim„_^oo = rj. Further, to allow for the possibility of job arrival 

that instantaneously shifts the workload from outside C(r/) to inside, we note from Lemma 1531 that we have 

limn^oo ^g"" ^ = rj. But according to the definition of s„ the workload X{sn) must be outside C{ri), so 

lim„_j.oo ^^s" ^ could not converge to r]. This establishes the necessary contradiction, showing 15. 171 and 
completing the proof of the Lemma 153] ■ 

We are now ready to construct sequence {sd^i satisfying properties I and II. We rename the se- 
quence defined in (15.16b to be {sc} and choose Sc = max{ Sc, (1 — £2)^0}, for some €2 G (0,1). (The second 
term (1 — e2)tc is used to guard against the degenerate case where Sc is finite because the workload X{t) is 
always in C(ry) after some finite time.) Then we have the properties: 

1. limc_j.oo ^"'^^ = e G (0, 1) and Sc < tc for each (large) c. 

2. C{X{t)) C C{r]) for all t G (sc, tc] and each (large) c. 

This means that {sc}'^i and {tcj^^ satisfy both Properties I and II, and Lemma 1531 completes the proof 
of rate stability in Theorem 15.11 ■ 



6 Performance Issues 



Section [3] established the universal stability of cone schedules for an entire class of matrices B. A natural 
question to consider is how the selection of B from within this class of matrices will affect other performance 
measures such as average workload and waiting time. 



In I Stolyar, 2004 1 it was shown that for a similar queueing system (with one environment state and 
nonnegative Sg vectors), the class of MaxWeight schedules, which are equivalent to cone schedules with a 
diagonal B matrix, will minimize the total workload in the system as well as the holding cost rate asymp- 
totically in heavy traffic. While a formal proof of a corresponding result is beyond the scope of this paper, 
we conjecture that a similar result will hold for these generalized cone schedules. This can be observed by 
considering the limiting behavior of the cone schedules when X is large. 

(X(t+),B,X(t+)> = 

{X{t),B, X{t)) + {A{t) - S{t),B{A{t) - Sm + 2 {A{t),BX{t)) - 2 {S{t),BX{t)) (6.1) 

for t+ > i the workload immediately after time t. If X(t) is large and fixed, then minimizing the 
expectation of the above equation is equivalent to maximizing {X{t),BS{t)), because the first term is fixed 
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by X{t) and no other ternis grow with X{t). This causes us to conjecture that the schedule that minimizes 
limt^oo I /o {X{z),BX{z)) will be the cone schedule with matrix B. A full proof of this optimality would 
require considerably more restriction on the arrival trace than has been presented in this paper. 

From a geometric point of view, the cones shift (and expand or contract), as the weights assigned to 
particular queues are adjusted. Figure |5]illustrates a simple system where the matrix B transforms the cone 
space. The diagonal elements of B expand and contract the cones in the dimension of the corresponding 
queue. The off-diagonal elements move the boundary between adjacent cones where both cones have a 
nonzero service rate to the two corresponding queues. 




Figure 5: The cone space with varying B matrices. The plots above illustrate the impact of matrix B on 
the cone space for a system with Q = 3 queues and Af = 4 service vectors. The first plot shows the cones 
for an identity matrix B. The middle plot has a diagonal B with large positive weight on the first queue, 
focusing more attention on service vectors which serve that queue. The third plot shows the impact of off 
diagonal elements of B. These elements affect the boundary between two queues. 

As highlighted in the stability proof, as X increases in magnitude in any given direction, the service 
vectors applied will rotate through the vectors which maximize [S, BX) for each environment. While in a 
particular- environment, X will be drawn toward the boundary of its current cone. In particular, if no new 
arrivals were allowed and the environment stayed constant, then X would follow a deterministic path to a 
cone boundary. Once at a boundary, the cone schedule would fluctuate around the service vectors which 
are optimal at that boundary and the workload would be drawn down along that boundary. The boundary 
planes act as attractors for the balance of queues in the system. Therefore one can view the matrix B as 
transforming the cone space in order to set the appropriate attracting boundary planes given by the cone 
intersections. This leads to an understanding of B as an important control on the relative importance of 
different workload dimensions. 

Cone schedules perform constrained dynamic load balancing of the queue workloads (weighted by 
the elements of B), observing the service constraints encoded in the service vectors S. As the workload of 
a queue increases excessively, the schedule shifts attention to it and selects available service configurations 
S* G 5"^ that provide more service capacity to that queue, potentially at the expense of others. That lowers 
the workload at the queue, trading it for increased workload in others and load balancing them. 

A strictly diagonal matrix B induces a direct simple priority scheme. That is, as the weight Bgg of 
queue q is increased (while those of others remain constant), the queue attains higher service priority. This 
results in the queue receiving more service bandwidth over time and enjoying a lower workload. 

When B has negative off-diagonal elements, those have an indirect effect on service priorities, entan- 
gling the queues and inducing a coupled priority scheme. That is, when Bpg < with p ^ q, the relative 
priority of queue p decreases as the workload of queue q increases. As Xg grows in size, more attention 
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needs to be paid in servicing queue q, while as Xp grows in size, less attention is paid to queue q. It can be 
seen that the weight Bpg < induces a specific coupling between the corresponding queues. 

We also observe that the proof of stability in section |5] is robust to any sublinear perturbations of 
information or time. In particular, if there is a switching delay between configurations or an informa- 
tion lag in knowledge of X{t), or some error in the calculation of maxs' {S,'BX) then as long as the 
corresponding perturbation does not grow linearly with t, then rate stability will still be assured. See 
|Ross and Bambos, 2009| for more detail on this observation. For example, if any calculation error or delay 
is bounded, then stability will hold. For clarity in the proofs we have not included additional terms, but the 
intuition is that any such sublinear term will have no impact on the limiting case as X{t) becomes large. 

For some processing systems, there can be computational issues in the requirement to calculate 
max5 {S, BX) in real-time over every possible sei-vice vector. The geometric structure of the cone sched- 
ules helps to overcome this by recognizing that when the workload vector X is large, any bounded change in 
workload will move the workload between adjacent cones (adjacent cones have a common boundary when 
3?'^ is divided into the cones Cs)- Therefore, cone schedules can be implemented by evaluating (5, BX) 
over a much smaller subset of service vectors at each point in time. This was shown to be a special case of 
sublinear perturbations in the cone schedules, and discussed in detail in |Ross and Bambos, 2009| . 



To conclude, the adjustment of the Q x Q entries of the matrix B allows for generating a rich family 
of stable service schedules. The dynamic priorities of the queues relate directly to the quality of service 
(QoS) they receive and the workload they see. We are currently exploring such performance issues further. 



7 Conclusions and Further Research 



We have established that the family of Cone Schedules maximizes the system throughput for very gen- 
eral processing systems under very general conditions. These schedules naturally select the best available 
vectors, and this leads to the maximum possible system throughput. Arrival and service processes are as 
general as possible, and the stability proofs are presented with minimal assumptions. Previous stability re- 
sults are generalized here to a setting with generahzed service vectors, fluctuating resource availability and 
continuous time scheduling. 

By exploring the analysis from a geometric standpoint we have gleaned important intuition for stabil- 
ity as well as performance and scalability of the schedules. Further research is necessary to deeper explore 
the effects of changes to the B matrix. The transition between environments is also a topic for further study, 
since it may not be trivial, especially as switching costs become important. 
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